Power-Normalized Cepstral Coefficients (PNCC) for Robust Speech Recognition
نویسندگان
چکیده
منابع مشابه
Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions
Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...
متن کاملDamped oscillator cepstral coefficients for robust speech recognition
This paper presents a new signal-processing technique motivated by the physiology of human auditory system. In this approach, auditory hair cells are modeled as damped oscillators that are stimulated by bandlimited time domain speech signals acting as forcing functions. Oscillation synchrony is induced by time aligning and three-way coupling of the forcing functions across the individual bands ...
متن کاملEstimation of Cepstral Coefficients for Robust Speech Recognition
Acknowledgement Acknowledgement I would first like to thank my advisor, Dr. Richard Povinelli, whose advice and support have been critical in the development of this work. My thanks also go to Dr. Michael Johnson, whose expertise has been very valuable to me. I also thank my committee members, Drs. George Corliss, Craig Struble, and Edwin Yaz for all their assistance. Thanks go to Dr. Mohamed M...
متن کاملPerceptual MVDR-based cepstral coefficients (PMCCs) for robust speech recognition
This paper describes a robust feature extraction technique for continuous speech recognition. Central to the technique is the Minimum Variance Distortionless Response (MVDR) method of spectrum estimation. We incorporate perceptual information directly in to the spectrum estimation. This provides improved robustness and computational efficiency when compared with the previously proposed MVDR-MFC...
متن کاملAugmented Cepstral Normalization for Robust Speech Recognition
We proposed an augmented cepstral mean normalization algorithm that differentiates noise and speech during normalization, and computes a different mean for each. The new procedure reduced the error rate slightly for the case of sameenvironment testing, and significantly reduced the error rate by 25% when an environmental mismatch exists over the case of standard cepstral mean normalization.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE/ACM Transactions on Audio, Speech, and Language Processing
سال: 2016
ISSN: 2329-9290,2329-9304
DOI: 10.1109/taslp.2016.2545928